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This listing of claims will replace all prior versionSp and llstlngB, of claims in 
the application: 



1 , (Currently amended) A method, comprising; 

identifying a loop in a program; 

Identifying each vector memory reference in the loop; 

determining dependencies between vector nriemory references in the loop, 
including determining unidirectional and circular dependencies; and 

reducing cache thrashing by distributing the vector memory references into 
a plurality of detail loops configured to allocate the vector memory 
references into_a_p_lLirality of temporary arrays, sized and located, so 
that none, of the vector memory references are cache synonyms , 
wherein the vector memory references that have circular 
dapendancies therebetween are included in a common detail loop, 
and wherein the detail loops are ordered according to the 
unidirectional dependencies between the memory references. 



2. (Original) A method, as set forth in claim 1, further comprising allocating a 
plurality of temporary storage areas within a cache and determining the size of 
each temporary storage area based on the size of the cache and the number of 
temporary storage areas. 

3. (Original) A method, as set forth in claim 1 , further comprising at least one 
section loop including the plurality of detail loops. 

4. (Original) A method, as set forth tn claim 1, wherein distributing the vector 
memory references Into a plurality of detail loops further comprises distributing 
the vector memory references into a plurality of detail loops that each contain at 
least one vector memory reference that could benefit from cache management. 
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5. (Original) A method, as set forth in claim 1, further comprising Inserting 
cache management instructions into at least one of said detail loops to control 
movement of data associated wrth the vector memory reference between a cache 
and main memory. 

6. (Original) A method, as set forth in daim 1, further comprising inserting 
prefetch instructions into at least one of said detail loops to control movement of 
data associated with the vector memory reference between a cache and main 
memory. 

7. (Original) A method, as set forth in claim 1, further comprising performing 
loop unrolling on at least one of said detail loops to control movement of data 
associated with the vector memory reference between a cache and main 
memory. 

8. (Original) A method, as set forth In claim 1 , further comprising inserting at 
least one of a prefetch instruction and a cache management instruction into at 
least one of said detail loops to control movement of data associated with the 
vector memory reference between a cache and main memory, and performing 
loop unrolling on at least one of said detail loops to control movement of data 
associated vwth the vector memory reference between a cache and main 
memory. 

9. (Currently amended) A method, comprising: 
identifying a loop in a program; 

identif/ing each vector memory reference in the loop: 
determining dependencies between vector memory references in the 
loop; and 

reducing cache thrashing by distributing the vector memory references into 
a plurality of detail loops that serialiv proceed throug h strips of the 
vector memory references and store the strips In tempo rary arrays 
so that none of the vector memo ry references are cache svnonvms. 
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wherein the vector memory references that have dependencies 
therebetween are included In a common detail loop. 

10. (Original) A method, as set forth In dalm 9, further comprising allocating a 
plurality of temporary storage areas within a cache and determining the size of 
each temporary storage area based on the size of the cache and the number of 
temporary storage areas. 

1 1 . (Original) A method, as set forth in claim 9, further comprising at least one 
section loop including the plurality of detail loops. 

12. (Original) A method, as set forth in claim 9, wherein distributing the vector 
memory references into a plurality of detail loops further comprises distributing 
the vector memory references into a plurality of detail loops that each contain at 
least one vector memory reference that could benefit from cache management. 

13. (Original) A method, as set forth In claim 9, further comprising inserting 
cache management instmctlons into at least one of said detail loops to control 
movement of data associated with the vector memory reference between a cache 
and main memory. 

14. (Original) A method, as set forth in claim 9, further comprising inserting 
prefetch instructions into at least one of said detail loops to control movement of 
data associated with the vector memory reference between a cache and main 
memory. 

15. (Original) A method, as set forth in claim 9, further comprising performing 
loop unrolling on at least one of said detail loops to control movement of data 
associated with the vector memory reference between a cache and main 
memory. 

18. (Original) A method, as set forth in claim 9, further comprising inserting at 
least one of a prefetch instruction and a cache management Instruction into at 
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least one of said detail loops to control movement of data associated with the 
vector memory reference between a cache and main memory, and performing 
loop unrolling on at least one of said detail loops to control movement of data 
associated with the vector memory reference between a cache and main 
memory. 

17. (Currently amended) A method, comprising: 
identifying a loop in a program; 

identifying each vector memory reference in the loop; 
determining dependencies between vector memory references In the 
loop; and 

reducing cache thrashing by distributing the vector memory references into 
a plurality of detail loops in response to cache behavior and the 
dependencies between the vector memory references In the loop^ 
wherein the detail loops cause storage of the vector memory 
references in temporary arrays that are allocated consecutively so 
that no temporary arrays elements are cache synonyms . 

18. (Original) A method, as set forth in daim 17, wherein distributing the 
vector memory references further comprises distributing the vector memory 
references into the plurality of detail loops with each loop having at least one of 
the identified vector memory references. 

19. (Original) A method, as set forth in claim 17, furttier comprising 
determining dependencies between vector memory references in the loop, and 
wherein distributing the loop includes distnbuting the vector memory references 
into the plurality of detail loops, wherein the vector memory references that have 
circular dependencies therebetween are Included In a common detail loop. 

20. (Original) A method, as set forth In claim 17, further comprising Inserting 
cache management instructions into at least one of said detail loops to control 
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movement of data aeeociated with the vector memory reference between a cache 
and main memory. 

21. (Original) A method, as set forth In claim 17, further comprteing inserting 
prefetch instructions into at least one of said detail loops to control movement of 
data associated with the vector memory reference between a cache and main 
memory. 

22. (Original) A method, as set forth In claim 17, further comprising performing 
loop unrolling on at least one of said detail loops to control movement of data 
associated with the vector memory reference between a cache and main 
memory. 

23. (Original) A method, as set forth in claim 17^ further comprising inserting 
at least one of a prefetch instruction and a cache management instruction into at 
least one of said detail loops to control movement of data associated with the 
vector memory reference between a cache and main memory, and performing 
loop unrolling on at least one of said detail loops to control movement of data 
associated with the vector memory reference between a cache and main 
memory. 

24. (Currently amended) A computer programmed to perform a method, 
comprising: 

Identifying a loop In a program; 
I identrfying each vector memory reference in the loop; 

determining dependencies between vector memory references in the 
loop; and 

reducing cache thrashing by distributing the vector memory references into 
a plurality of detail loops configured to retrieve strips of the vector 
memon/ references and store the strips in temporary affavs. 
wherein the vector memory references that have circular 
dependencies therebetween are included in a common detail loop. 

1MSfla.01/1BB2 484O0 Pafle 6 Of 1 6 MP PDNO 200501732-1 

PAGE 11/21 ' RCVD AT 10114/2004 4:45:56 PM [Eastern Daylight Time]' SVR:USPTO-EFXRF»1IO' DNIS:8729306 ' CSID:71323m ' DURATION M:06"20 



10/14/2004 



15:50 FAX 



7132388008 



(21012/021 



AppL No. 09/785.143 

Amdt datad October 14, 2004 

Reply to Office action of July 15, 2004 

25. (Currently amended) A program storage medium encoded with 
inatructions that, when executed by a computer, perform a method, comprising: 

identifying a loop In a program; 
Identifying each vector memory reference in the toop; 
determining dependencies between vector memory references in the 
loop; and 

reducing cache thrashing by distributing the vector memory references into 
a plurality of detail loops configured to allocate the vector memory 
references into temporary arrays that_avQld cache synonyms , 
wherein the vector memory references that have circular 
dependencies therebetween are included In a common detail loop. 

26. (Currently amended} A compiler, comprising: 
means for identifying a loop in a program; 

means for identifying each vector memory reference In the loop; 

means for determining dependencies between vector memory references 
In the loop, including determining unidirectional and circular dependencies; and 

means for reducing cache thrashing by distributing the vector memory 
references Into a plurality of detail loopa _configured to seriallv proce ss strips of 
the vector memory references_.sQ_J hat thrashing does not occur, wherein the 
vector memory references that have circular dependencies therebetween are 
included in a common detail loop, and wherein the detail loops are ordered 
according to the unidirectional dependencies between the memory references. 

27. (Original) A compiler, as set forth In claim 26, further comprising means 
for allocating a plurality of temporary storage areas within a cache and 
determining the size of each temporary storage area based on the size of the 
cache and the number of temporary storage areas. 

26. (Original) A compiler, as set forth in claim 26, wherein the means for 
distributing the vector memory references Into a plurality of detail loops further 
comprises distributing the vector memory references Into a plurality of detail loops 
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that each contain at least one vector memory reference that could benefrt from 
cache management. 

29. (Original) A compiler, as set forth in claim 26, further comprising inserting 
cache management instructions into at least one of said detail loops to control 
movement of data associated with the vector memory reference between a cache 
and main memory. 

30. (Original) A compiler, as set forth in claim 28, further comprising means 
for inserting prefetch instructions into at least one of said detail loops to control 
movement of data associated with the vector memory reference between a cache 
and main memory. 

31. (Original) A compiler, as set forth in claim 26. further comprising means 
for perfonming loop unrolling on at least one of said detail loops to control 
movement of data associated with the vector memory reference between a cache 
and main memory. 

32. (Original) A compiler, as sat forth In claim 26. further comprising means 
for Inserting at least one of a prefetch Instruction and a cache management 
instruction into at least one of said detail loops to control movement of data 
associated with the vector memory reference between a cache and main 
memory, and peritamiing loop unrolling on at least one of said detail loops to 
control movement of data associated with the vector memory reference between 
a cache and main memory. 

33. (Currency amended) A method for reducing the likelihood of c^che 
thrashing by software to be executed on a computer system having a cache, 
comprising: 

executing the software on the computer system; 

generating a profile indicating the manner In which the software uses the 

cache; 
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identifying a portion of the software tliat exiiibits cache thrashing leased on 
I w t ng the profiie dat a that may e xp e ri e nc e oaohe thrash i ng ; and 

modifying the identified portion of the software to reduce the likelihood of 
cache thrashing bv distributing cache svnonvms into d etail ioops configured to 
allocate the cache svnonvms into temoorarv atoraoe areas, sized and located, to 
ofeviantcacheJhrji&hjng . 

34. (Previously presented) A method, as set forth in claim 33, wherein 
modifying the identified portion of the software to reduce the likelihood of cache 
thrashing further comprises: 

identifying a loop In the identified portion of the software; 

identifying each vector memory reference In the identified loop; 

determining dependencies between the vector memory references in the 
identified loop of the software, including determining unidirectional and circular 
dependencies; and 

reducing cache thrashing by distributing the vector memory references into 
a plurality of detail loops, wherein the vector memory references that have 
circular dependencies therebetween are included in a common detail loop, and 
wherein the detail loops are ordered according to the unidirectional dependencies 
between the memory references. 

35. (Currently amended) A method for reducing the likelihood of cache 
thrashing by software to be executed on a computer system having a cache, 
comprising: 

executing the software on the computer system; . 

generating a profile indicating the manner In which the memory references 
of the software use the cache; 

identifying a first and oooo nd portion of the memory references based on 
the profile, wherein th e first portion of the memory references is detemnlned to 
caus o mav bo oxporiono i ng cache thrashing; and 

reducing cache thrashing by distrlbutlng - a t l east - a portion of the ftfst 
portion of the memory references into distinct loops that allocate strips of the 
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memorY refer^np^? IntQ t^mpprary art^ v fftQf ^ ;>^e pu ti p n , and p l oo i ng qt loaot the 
se cond - pQFt l oFKrf - th e m e mory r e ter e nooo i nto the di s t i nct l oops . 

36. (New) The oomputer of claim 24 wherein the temporary arrays are 
allocated consecutively such that no temporary array elements are cache 
synonyms. 

37. (New) The computer of claim 24 wherein the details loops are allocated 
into section loops that cause Iterative execution of the detail loops based on a size 
of the strips. 

38. (New) The program storage medium of claim 25 wherein the temporary 
arrays are allocated consecutively such that no temporary array elements are 
cache synonyms. 

39. (New) The method of claim 35 wherein the temporary arrays are located 
and sized to reduce cache thrashing. 

40. (New) The method of claim 35 wherein the temporary arrays are allocated 
consecutively and iteratively executed by a size of the strips. 
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